首页> 外文OA文献 >The Performance of Boolean Retrieval and Vector Space Model in Textual Information Retrieval
【2h】

The Performance of Boolean Retrieval and Vector Space Model in Textual Information Retrieval

机译:布尔检索和向量空间模型在文本信息检索中的性能

摘要

Boolean Retrieval (BR) and Vector Space Model (VSM) are very popular methods in information retrieval for creating an inverted index and querying terms. BR method searches the exact results of the textual information retrieval without ranking the results. VSM method searches and ranks the results. This study empirically compares the two methods. The research utilizes a sample of the corpus data obtained from Reuters. The experimental results show that the required times to produce an inverted index by the two methods are nearly the same. However, a difference exists on the querying index. The results also show that the numberof generated indexes, the sizes of the generated files, and the duration of reading and searching an index are proportional with the file number in the corpus and thefile size.
机译:布尔检索(BR)和向量空间模型(VSM)是信息检索中非常流行的方法,用于创建反向索引和查询术语。 BR方法搜索文本信息检索的准确结果,而不对结果进行排名。 VSM方法搜索结果并对其进行排名。这项研究从经验上比较了这两种方法。该研究利用了从路透社获得的语料库数据样本。实验结果表明,两种方法生成倒排索引所需的时间几乎相同。但是,查询索引存在差异。结果还表明,生成索引的数量,生成文件的大小以及读取和搜索索引的持续时间与语料库中的文件数和文件大小成正比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号